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Abstract 

Correlations of active and passive random intersection graphs are studied in this 



On 

| letter. We present the joint probability generating function for degrees of G acUve (n, m,p) 

£SJ ■ and G pasmve (n,m,p), which are generated by a random bipartite graph G*(n,m,p) 

' on n + to vertices. 
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CM 

q ; 1. Introduction 

Consider a set V with n vertices and another set of objects W with m objects. Define 
a bipartite graph G*(n,m,p) with independent vertex sets V and W. Edges between 
v £ V and w €W exist independently with probability p. The active random intersection 
graph G acttve {n, m,p) derived from G*(n, m,p) is defined on the vertex set V with vertices 
vi,V2 E V adjacent if and only if there exists some w E W such that both v\ and vi 
are adjacent to w in G* (n,m,p). Analogously, the passive random intersection graph 
G passme (n,m,p) is defined on the vertex set W with vertices wx,W2 E W adjacent if and 
CM | only if there is some v E V such that both w\ and W2 are adjacent to v in G*(n,m,p). 

The models of random intersection graphs defined above were first introduced in [U[9]. 
Observe that the degree of a given vertex v € V (or w G W, respectively) in G*(n, m,p) is 
binomial Bin(m,p) (or Bin(n,p), respectively) distributed, and by interchanging the roles 
of V and W, the active and passive graphs are dual to each other essentially. The random 
intersection graphs are generalized in [3] by allowing an arbitrary distribution for vertex 
degree in G*(n,m,p) instead of merely binomial. Extended in such a way, the active and 
passive graphs reveal different properties including degree distributions; see [3] for details. 
^ Intersection graphs are relationship graphs and widely applied in various fields such as 

investigation of secure wireless sensor networks pQ, modeling of social networks [8], and 
statistical tests in cluster analysis for non-metric data [3]. 

An interesting topic in the study of random intersection graphs regards the interrela- 
tions between the active and passive graphs. For example, if V and W represent researchers 
and research papers, respectively; and v £ V and w E W are adjacent if researcher v is 
an author of paper w. Thus, the resulting active graph is a collaboration graph on V of 
researchers, and accordingly, the passive graph is a relation graph on W of papers. The 
correlation of these two graphs shall shed light on the in-depth architectures and patterns 
in scientific collaboration [7J; e.g. in what way and to what extent, the appearance of 
certain configurations in G acttve (n,m,p) may influence the topology of G passive (n,m,p) 
and vice versa. 

In the current paper, as a first step towards this research, we explore the correlation of 
vertex degrees between G actlve (n,m,p) and G passive (n,m,p), and the corresponding joint 
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probability generating function is provided by exploiting the sieve method. Some related 
works dealing with degree distributions of random intersection graphs have been done. 
|1L)| treats the degree distribution of G acUve (n,m,p) with parameters m = \n a \ for some 
a > and p = ^/cn~^ 1+a ^ 2 for some c > 0, and the distribution of the degree of a given 
vertex is shown to converge to a point mass at 0, a Poisson distribution or a compound 
Poisson distribution depending on whether a < 1, a = 1 or a > 1. [4 J examines the 
sufficient and necessary conditions for generalized active random intersection graph to 
have a Poisson limiting degree distribution. The related conditions which imply a limiting 
degree distribution are given in [H] for generalized passive random intersection graph. To 
the best of our knowledge, our work is the first one devoted to joint degree distributions 
between the active and passive graphs. It is obvious that the number of edges in active 
and passive graphs should be positively correlated. 

The rest of the paper is organized as follows. The main result (Theorem 1) is presented 
and proved in Section 2. In Section 3, we conclude the paper with some further remarks. 

2. Joint probability generating function 

Let X be the number of vertices of ^\{i>} adjacent in G actme (n,m,p) to a vertex 
v £ V, and Y be the number of vertices of adjacent in G passme (n,m,p) to a 

vertex w G W. That is, X and Y are typical vertex degrees in G actwe (n,m,p) and 
G passive (n,m,p), respectively. The joint probability generating function of X and Y is 
defined to be F(x,y) := Ex x y Y for x,y G R. The main result in this paper is stated as 
follows. 

Theorem 1. The joint probability generating function F(x,y) is given by 

n*,v) = E E ( 7 ) ( m 7 ) ^ _1_fe ( 1 - *) v- 1 -^ - vf 

k=0 1=0 ^ ' ^ ' 

■ [1 - p + p(l - p) k ] m - 1 - 1 [1 - p + p(l - pf] M 

■ "(i - p) k+i P +(i-p)J2 (TV a - pf~ l [(! - p) i+1 + pi 1 - p) 1 ] k ~ ■ 

i=0 

Let F(x) and F{y) denote the probability generating functions for random variables 
X and Y, respectively. Since F(x) = F(x,l) and F(y) = F(l,y), we have the following 
corollary which is consistent with Theorem 1 in [10J. 

Corollary 1. The probability generating functions F{x) and F(y) are given by 

n—1 / _ -i\ 

f( x ) = E{ n k ) xM ( 1 - x ) k t 1 - p + pd - p) k ] m 

and 

m—l , _ i\ 

m = E ( m y 1 ™^ -y) l i 1 -p+ p(i - p) 1 ] n 

1=0 ^ ' 

respectively. 

To prove Theorem 1 , we will invoke a lemma which is a probability generating function 
version of the sieve method, whose proof is similar to Lemma 1 in [10]. In fact, the following 
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lemma can be viewed as a high dimensional extension of Lemma 1 in [TO]. Let P\ and 
P2 be two disjoint sets of properties that a random object can take on. Let pkj be the 
probability that the object takes on exactly k properties in Pi and I properties in Pi- The 
related probability generating function is defined as F(x,y) := l>0 pkjx k y l . 

Lemma 1. Given S\ C Pi and S2 C P2, we define N$ l ,s 2 to ^ e the event that the random 
object possesses properties S± and S^- Define 

N k y.= Yl P ( N SuS 2 ) and N{x,y) := £ N k>l x k y l . 
\Si\=k,\S2\=l k,l>0 

Then we have 

F(x,y) = N(x-l,y-l). 

Proof. Let Y\ and Y2 be the numbers of properties that the random object possesses in 
Pi and P2, respectively. Let 1n Si s 2 ' 3e * ne indicator function of event Ng lt s 2 - We clearly 
obtain 

' M (Y 2 
I 



|5i|=fc,|S 2 |=« \Si\=k,\S 2 \=l 

Hence, 

- e<©(?)>v-[eC;>*e(?) 

k,l>0 v v 7 v 77 L fc>0 v 7 />0 v 7 

= E[(x + l) Yl (y + l) Y *] =F(x + l,y + l). 

□ 

In the following proof, we will take Pi as the set of n — 1 properties consisting of the 
non-adjacency of a fixed vertex to the other n — 1 vertices in V; and take P2 as the set of 
m — 1 properties regarding W similarly. 

Proof of Theorem 1. For v € V and w 6 H 7 , let ; be the probability that exactly 
vertices in V\{v} are not adjacent to v and / vertices in Vr\{w;} are not adjacent to w. 
Let G(x, y) be the corresponding probability generating function. The probability that 
the fixed vertices v and w are adjacent to none of the vertices represented by Si C V\{v} 
and S2 C respectively is given by 

P(Nsi,S 2 ) = P(N Sl ,s 2 \v ~ w) ■ p + P{N Sl ,s 2 \v ^ w) ■ (1-p), 

where 

n-\Si\ m-\S 2 \ . I c |\ / 1 I a IN 

p ( ^, s >^) = g g ( m -/_-;*') ("-/_-; Si1 ) 

and 

I Si I n-l-|Si| |S 2 | m-l-\S a \ , ic IN /lo IN / 1 I a IN /I c IN 

n^M.) - £ E E E (» " 1" (f) C- 1 ,; 1811 )^ 1 ) 

.pJo+J»(l — p) m -l-io-ispio+«a n _ p\n-l~i -i s 

.(1 _ p\(jo+js)\Si\+(io+i s )\S 2 \-i s js _ ^2) 
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In the above expression ([T]), the index i counts the number of vertices of V adjacent to 
w in G*(n,m,p) and similarly, j counts the number of vertices of W adjacent to v in 
G*(n, m,p); c.f. Fig. 1. For the expression (£2§, i a + i s counts the number of vertices of V 
adjacent to w in G*(n,m,p), where i Q counts vertices outside S\ while i s inside. The roles 
of indices j Q and j s can be interpreted likewise; c.f. Fig. 2. The inclusion and exclusion 
principle is utilized here. 
Hence we have 



N, 



k,l 



n — 1\ / m — 1 
k \ I 



n—k m—l 

EE 

i=l 3=1 



m — 1 — l\ fn — 1 — k 



k n—l—k I m—l— I 

+ E E E E 

ia=0 io=0 js=0 jo=0 



n—i+(i—l)l 



P 



m — 1 — l\ ( l \ (n — 1 — k\ f k 

jo 



.pjo+jsn _ p^m-l-j -j s + (i +i s )kpi +i s _ ^^n-l-i -i s + (i +i s )/-i s j s /j 

n * ( m r x ) I [i - p + p(i - p)l] M{i -p^-p+p^- p) k ] 

.(1 _ p) l p + [i _ p + p (i _ [i _ p + p (i _ _ p) 

k I 

EE 

n — l\ / m — 1 

k 



k\ ( I 



[p(l - p) 1 ) ls {l- p) k ~ l ° [p(l - p) k ] Js (1 - p)*- J '«(l - p)-*'^ 

[1 - P + P(l - P) fc ] m ' x - 1 [1 " P + p(i - p)<] n ^ fc 



i=0 



(i - p) fc+ 'p + (i - p) £ . P *(i - p) z - [(i - P y +1 + p(i - p) z ] 



Consequently, Lemma 1 yields G(x,y) = N(x — l,y— 1), where 

Ti—l m—l 

N(x, y ) = EE^V 



k=0 1=0 
71—1 m—l 

E E *v 

£:=0 Z=0 



n — 1 \ / m — 1 



[1 _ p + p(i _ p)*] [i _ p + p(i _ p )i] 



n n—l—k 



i=0 



(i - p) fc+ 'p + (i - p) £ . p*(i - p) 1 -* [(i - P y +1 + p(i - p) z ] 



The result then follows from the fact that 

n—lm—l n—lm—1 



k=0 1=0 



k=0 1=0 



□ 



3. Concluding remarks 

In this letter, we study the joint probability generating function F(x, y) for degrees 
of G acttve (n,m,p) and G pass ' lve (n,m,p) by employing the sieve method. We mention that 



4 



our work is only a preliminary step to the interesting and meaningful topic of correlation 
of active/passive random intersection graphs. Corollary 1 gives the probability generating 
functions of the marginal distributions P(X = k) and P(Y = I), respectively. As is known, 
the random variables X and Y are independent if and only if F(x, y) = F(x)F{y) for all 
x, y (c.f. [2] pp. 279). Hence, we may tackle the interdependence between the active graph 
and passive graph through their probability generating functions. For the future research, 
the method developed in this paper may be applied to the joint distribution of degrees of 
generalized active and passive graphs in [3]. A more hard question is to investigate the 
joint distribution of clusters in these two models. 
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Figure 1: G*(n,m,p) in the case of v ~ w. Solid line represents edges and dashed line 
represents non-edges. 
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Figure 2: G*(n,m,p) in the case of v 96 w. Solid line represents edges and dashed line 
represents non-edges 
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